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Abstract 



Chevron rollovers of some proteins imply that their logarithmic folding 
rates are nonlinear in native stability. This is predicted by lattice and con- 
tinuum Go models to arise from diminished accessibilities of the ground state 
from transiently populated compact conformations under strongly native con- 
ditions. Despite these models' native-centric interactions, the slowdown is due 
partly to kinetic trapping caused by some of the folding intermediates' non- 
native topologies. Notably, simple two-state folding kinetics of small single- 
domain proteins are not reproduced by common Go-like schemes. 



PACS Numbers : 87.15.Aa, 87.15. Cc, 87.15.He, 87.15.By 
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The physical basis of protein folding is a central unresolved puzzle in molecular biology. 
Recently, much advance in protein folding has originated from experiments on small single- 
domain proteins [1] with simple two-state folding and unfolding kinetics typified by that 
of CI2 [2], with features including: (i) single-exponential relaxation, (ii) the logarithmic 
folding and unfolding rates (In fcf and In k^) at constant temperature being essentially linear 
in chemical denaturant (urea or GuHCl) concentration, i.e., both arms of the "chevron plot" 
[3] are linear, and that (iii) the equilibrium ratio of native to denatured conformational 
population K = [native] /[denatured] = h/k-a- What form of intrachain interactions might 
give rise to such remarkable behavior is a question of fundamental biophysical interest. 

Other proteins' corresponding properties are more complex. Often this is manifested 
by significant deviations [4-9] from the above linearities, i.e., they exhibit chevron rollovers 
[10]. We refer to their kinetics as non-two-state. Examples of such behavior include barnase 
[4,8], ribonuclease A [5], hen lysozyme [6], and UlA [7]. The present operational definition 
of non-two-state kinetics encompasses what some authors called "two-state" (though not 
"simple") when conditions (i) and (iii) above are satisfied but not (ii) [7,8]. Chevron rollover 
can also be brought about by mutation, as in S6 [7] and BPTI [9]. Thus, rather than an 
aberration, chevron rollover is quite ubiquitous. Therefore, ascertaining its physical origins 
should provide important clues to protein energetics. 

Chevron rollovers have been attributed to peculiarities of intermediates or transition 
states on postulated free energy profiles [4-8,10], or front factors' sensitivity to folding condi- 
tions [11]. Yet these phenomenological considerations do not pinpoint the physical processes 
involved. In this Letter, physical mechanisms underlying chevron rollovers are addressed 
directly by examining a multitude of trajectories from several protein chain models. 

The recent discovery of a remarkable correlation between contact order and folding rates 
of simple two-state proteins [12] has led to extensive studies of Go-like protein models [13- 
17]. Hence, a natural question is whether common Go- like constructs do predict simple 
two-state kinetics. Somewhat surprisingly, our investigation thus far indicates that this may 
not be the case. Instead, chevron rollover emerges as a conspicuous feature in both lattice 
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[11] and continuum [17] Go models. This suggests that common native-centric [16] chain 
constructs can be useful for elucidating the polymer mechanisms of chevron rollovers, even 
though they may not be entirely adequate for simple two-state proteins. Pursuing this logic, 
we now analyze a thermodynamically cooperative [16] 48mer three-dimensional lattice Go 
model [14]. This model had notable impact on recent appraisals [18] of the energy landscape 
views of protein folding [19], but its chevron behavior has not been investigated. 

Each native contact in this model has a favorable energy e (< 0), nonnative contacts 
have zero energy. Folding/unfolding kinetics are modeled by Metropohs Monte Carlo (MC) 
dynamics with the same set of elementary chain moves as in [14]: End moves are attempted 
for the two chain-end monomers. Corner and crankshaft moves are attempted for other 
monomers with 70% and 30% probability respectively. Time is measured by the number of 
attempted MC moves; Q is fractional number of native contacts [15,17]. 

The chevron plots for this Go model and a closely related model are shown in Fig. 1. 
When relaxation is single-exponential (see below), ki or k-a — 1/MFPT. Most of the MFPTs 
here are averaged from at least 1,000 trajectories, except for a narrow e/k-BT range around 
the transition midpoint where kinetics are relatively slow (100-200 trajectories each) and 
for folding initiated from Fig. 2(d) (200 trajectories each). In these models, the free energy 
AGu of unfolding to the open conformations {Q < 6/57) is essentially linear in e/ksT. Thus 
we model denaturant concentration changes by varying e/k-eT [11,20]. Adding repulsive 
nonnative contact energies to a Go model [20,21] does not appear to have a significant 
impact on the chevron behavior. Fig. 1 shows dramatic chevron rollovers of the folding 
arms and very slight rollovers of the unfolding arms for both models, with maximum folding 
rates at AG^/kBT— 14.2 (Go) and 16.2 (Go plus repulsion). Fig. 1 indicates that deviations 
from simple two-state behavior can be difficult to discern under weakly native conditions [22]. 
To facilitate comparison with experiments, we characterize folding rollover by the difference 
between the hypothetical simple two-state Inkf"^ (inclined dotted fines in Fig. 1) and tfie 
actual (simulated) folding rate In/cf — In(MFPT) at three representative values of native 
stability AG^, spanning a range typically covered by real proteins. Here, for AGu/Zcer = (5, 
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10, 15), the logarithmic rollover ratio ln{ki-yk{) = (0.16, 1.08, 2.96) for the Go model and 
(0.32, 1.36, 3.12) for the model with repulsive interactions. These ratios are not dissimilar 
to the corresponding ln{k'^~^/kf) ~ (0.77, 2.42, 4.28) for wildtype barnase at 25°C and pH 
6.3 [4]. Under these conditions, a maximum folding rate was not observed for barnase [4]. 
However, if a quadratic dependence [7,23] of In kf vs. denaturant is assumed for barnase (c.f. 
[8]), a maximum folding rate ~ 230 s~^ may be extrapolated to occur at an hypothetical 
AGu ~ 'iOksT which is much more stable than the AGu ~ IS-OksT at zero denaturant. 

Fig. 2(a) provides the Go model's conformational distributions at different native sta- 
bilities. Under mildly native conditions {AG^ < IO/crT), the free energy profiles have a 
barrier between the native and denatured minima. Under more strongly native conditions 
(AGu > IS/cbT"), their shapes are suggestive of downhill folding [24]. The analysis of first 
passage time distribution [16,25] in Fig. 2(b) indicates that folding kinetics is approximately 
single exponential [In P{t) linear in t] under mildly native conditions, consistent with the ob- 
served single-exponential folding kinetics for ribonuclease A when double-jump experiments 
were used to eliminate the effect of cis/trans proline isomerization [5]. The behavior of wild- 
type barnase is similar: Folding is fast and single-exponential for the majority of the chains 
(si 80%), the rest belongs to a slow-folding tail caused by prohne isomerization [4]. However, 
when modeling conditions are strongly native (corresponding conditions may not always be 
experimentally achievable [11]), folding kinetics is not single-exponential [circles and squares 
in Fig. 2(b)]. The onset of this behavior occurs approximately when the — ln[P(Q)] profile 
becomes downhill and where folding rate is maximum (c.f. Figs. 1 and 2) [11]. It would be 
instructive to ascertain whether this specific model prediction applies to real proteins. 

A closer examination of the model folding trajectories indicates that the slowdown leading 
to folding-arm chevron rolloves arises from transiently populated compact non-ground- state 
conformations because these folding "intermediates" have lifetimes that increase with in- 
creasingly native conditions (Figs. 1, 3 and 4). Examples are shown in Fig. 3(b-d). Once 
one of these structures is adopted under strongly native conditions (e/k^T > 1.55), it takes 
longer on average to reach the ground state if intrachain native contacts are more favorable 
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(Fig. 1). Folding trajectories under strongly native conditions are qualitatively different 
from that under milder conditions [Fig. 4(a-c)]. At the transition midpoint [Fig. 4(c)], in- 
terconversions between Q ^ 0.2 and Q ~ 1.0 are sudden and sharp. Relatively little time 
is spent at intermediate Q values. As e/k-oT increases, however, certain conformations with 
intermediate Q ~ 0.6-0.8 are frequented more. Even under mildly native conditions, their 
impeding effects on folding kinetics is already apparent from the event in Fig. 4(b) depicting 
the chain bounces back to Q ^ 0.2 after achieving Q ^ 0.8. But the lifetimes of these "in- 
termediates" are brief compared to that of the open unfolded conformations. Hence folding 
remains approximately single-exponential [triangles in Fig. 2(b)]. However, when conditions 
become more strongly native [Fig. 4(a)], some folding trajectories are dominated by "inter- 
mediates," leading to a significant reduction in average folding time. But even under these 
circumstances it is still possible to fold quickly [Fig. 4(a), inset]. Consistent with Fig. 2b 
(circles), this separation of time scales means that folding is no longer single-exponential. 

In contrast to a previous report that no "entangled misfolded state" was observed during 
the folding of this particular Go model [14], Fig. 3(b) exhibits an overall nonnative topol- 
ogy. For Fig. 3(c), the left side of the conformation is native, but the right side is substan- 
tially nonnative. Hence these conformations are kinetic traps in that they cannot reach the 
ground state without first open up somewhat by breaking some existing favorable contacts. 
Notwithstanding possible artifacts of lattice models [Fig. 3(d)], this basic physical require- 
ment rationalizes folding-arm chevron rollover because favorable contacts contributing to 
the meta-stability of these traps are increasing difficult to break with stronger —e/k-QT. 

This prediction appears to be robust over a range of lattice and continuum coarse-grained 
models [Fig. 4(d-e)] that exhibit chevron rollovers. Fig. 4(d) shows that folding of a recent 
lattice model with residue-based as well as native-centric interactions are similar to that in 
Fig. 4(b). Fig. 4(e) shows that folding of a continuum Cq, model under mildly and strongly 
[inset in (e)] native conditions are very much similar, respectively, to the corresponding 
lattice results in Fig. 4(b) and (a). The trajectories in Fig. 4(f) from a continuum model 
[17] with desolvation barriers [26] also show that intermediate Q values are more prominent 
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during the folding process when conditions are more strongly native [inset in (f)]. 

A maximum (or "optimal" ) folding rate similar to those in Fig. 1 have been observed in 
many models (e.g., [19,20,21,23,25]) since it was first noted in HP model simulations more 
than a decade ago [27]. This feature arises from a competition between a stronger driving 
force for folding and the onset of glassy dynamics under strongly native conditions (see note 
added in proof of [20]). However, until recently [11,17], the connection between this theoret- 
ical phenomenon and chevron rollover has not been recognized. Perhaps this is because the 
maximum folding rate often occurs near the transition midpoint for less cooperative mod- 
els, and hence its relationship with chevron plots is less obvious. In contrast, the models 
studied here possess proteinlike thermodynamic cooperativity [16]. Several basic principles 
now emerge: (i) Rollovers can arise from kinetic trapping [6,9,20]; but folding relaxation 
remains approximately single exponential when trapping effects are mild [Fig. 2(b)]. (ii) We 
have rationalized rollovers phenomenologically by front factors that depend on AG^ [11,17]. 
Physically, this dependence is likely caused by trapping and unfolding (barrier recrossing) 
from transiently populated compact non-ground-statc conformations (Fig. 4). These pre- 
dictions are testable by experiments, (iii) The chevron rollovers in the Go models presented 
suggest strongly that, contrary to expectation. Go-like pairwise additive interactions are in- 
sufficient [17] to capture the remarkable kinetics of small single-domain proteins [1]; further 
research is necessary to ascertain the physical origin of their simple two-state cooperativity. 
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Fig. 1 MFPT is mean first passage time, UbT is Boltzmann constant times absolute 
temperature. Circles are for the 48mer Go model [14] . Squares are data from a model with 
the same ground-state conformation and attractive energy e for each native contact but with 
an additional repulsive energy — e for each nonnative contact (as in the HP-|- model [20] and 
Ref. [21]). Folding (open symbols) starts from a random self-avoiding walk, first passage 
is achieved when Q = 1. Unfolding (filled symbols) starts from the Q = 1 ground state, 
first passage is achieved when Q < 6/57 because the free energy minimum of the denatured 
states is at Q f=i 6/57. Sohd dashed curves are mere guide for the eye. The vertical dotted 
lines mark the two model's thermodynamic transition midpoints. The two pairs of V-shape 
dotted lines are hypothetical simple two-state chevron plots [11] based upon AGu between 
Q — 1 and Q < 6/57 obtained by histogram techniques from sampling around the transi- 
tion midpoint. The triangles, asterisks, and diamonds are — In(MFPT) values for Go-model 
folding initiated (at t — 0) respectively from the compact conformations (b), (c), and (d) in 
Fig. 3. Arrows indicate the e/ksT values considered in Fig. 2. 



Fig. 2 (a) Free energy profiles for the Go model at the e/Zce^ values indicated (c.f. 
Fig. 1). P{Q) is Boltzmann population distribution over Q. [Note that P{Q) — ior Q — 
55/57 and 56/57.] (b) P(t)At is the fraction of folding trajectories with t — At/2 < first 
passage time <t + At/2 [17], plotted in different horizontal scales for different e/ksTs [sym- 
bols as in (a)] to enhance clarity. For e/kBT — —1.82, —1.61, —1.47, and —1.28 respectively, 
2,500, 2,030, 3,500, and 1,100 trajectories are analyzed using At/lO^ = 30, 1.8, 1.6, and 30; 
the ln[P{t)At] shown are for t values equal to 1, 1/10, 1/20, and 1/2 of that given by the 
horizontal axis. Solid and dashed lines are linear fits for e/k-^T — —1.47 and —1.28. 



Fig. 3 (a) Ground state of the Go model, (b-d) Transiently trapped conformations under 
strongly native conditions (c.f. Fig. 1), with Q — 39/57, 41/57 and 53/57 respectively. The 

dotted, dashed, dottcd-dashed lines and filled circles are used to identify monomers (in all 
four conformations) that belong to three of the straight edges and the core positions in the 



ground-state conformation (a). The conformation in (d) may reach the ground state by a 
simple hinge motion of the dotted-dashed edge. However, since such a move is not available 
in the model [14], the chain now must first partially open up before it can access the ground 
state. The trapping effect of (d) is minor compared to that of (b) and (c) (see Fig. 1). 

Fig. 4 (a-c) Folding trajectories of the 48mer Go model at e/k-QT = —1.82 (a), —1.47 
(b), and —1.28 (c). The insets in (a, b) each shows a faster folding trajectories at the same 
given e/ksT. (d-f) Typical folding trajectories in other models for comparison: (d) is from a 
55mer lattice model under mildly native conditions {e/k-oT = —1.75) [11]. (e, f) are from the 
continuum NCSl without-solvation (e) and with-solvation (f) Langevin dynamics models at 
T = 0.82 for CI2 [17] with e = 0.88 (e) and 1.1 (f). The insets in (e, f) show trajectories at 
the same T but under more strongly native conditions at e = 1.0 (e) and 1.5 (f). 
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